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Abstract. We give a new proof of the Cobham's first theorem using ideas 
from symbolic dynamics and of the Cobham-Semenov theorem (in the primi- 
tive case) using ideas from tiling dynamics. 



1. Introduction 

Given a subset E of N ^ {0, 1, 2, • ■ • } can we find an elementary algorithm which 
accepts the elements of E and rejects those that do not belong to E ? 

By "elementary algorithm" we mean a finite state automaton. This question orig- 
inates from the work of Biichi (1960, [Bu| ) . Cobham gave two answers to this 
question. In 1969 he proved that the existence of such an algorithm deeply de- 
pends on the numeration base, more precisely : 

Cobham's first theorem. Letp andq be two multiplicatively independent integers 
greater or equal to 2. Then, a set E G N is both p-recognizable and q-recognizable 
if and only if E is the finite union of arithmetic progressions. 

Here "p-recognizable" means that there exists an automaton which accepts exactly 
the language consisting of the expansions in base p of the elements of E, and p and 
q are multipUcatively independent if whenever "p'^ = for some non-negative 
integers k and 1" then fc = ^ = 0. Note that finite sets are finite unions of arithmetic 
progressions (of length 1) and hence are p-recognizable for all p. 
For example, we will see that the set {2"; n e N} is 2-recognizable and, as it is not 
a finite union of arithmetic progressions, it can not be 3-recognizable. Cobham's 
Theorem implies that the set {2n; n G N} is p-recognizable for any p E N. But 
it does not tell us anything about the structure of recognizable sets of integers. 
Cobham gave a second answer which gives a complete description of their structure : 

Cobham's second theorem. A set E C N is p-recognizable if and only if its 
characteristic sequence x g {0, 1}^ (xi = 1 if and only if i € E) is the image by a 
letter to letter morphism of a fixed point of a substitution of constant length p. 

The original proof of the Cobham's first Theorem was considered by S. Eilenberg in 
[Ei] as "almost elementary" but " highly technical" , the technicality being concen- 
trated in the direct implication, the reverse implication being much less difficult. 
In 1974, Eilenberg suggested in [Ei] to find some more readable proofs. Hansel gave 
in [Hal] some ideas in this matter and pointed out that the proof consists of two 
steps: 1) E is syndetic, 2) completion of the proof. All the other proofs and all the 
proofs of the generalizations follow the same dichotomy. 

Later Muchnik introduced the key concept of definable sets and, using the formalism 
of first order logic, gave a more comprehensible proof of Cobham's first theorem. 
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In 1993, Michaux and Villeniaire [MVj used these definable sets to prove what 
is known as the Cobham-Semenov theorem. This theorem was originally proved 
by Semenov in [Sel and is a multidimensional generalization of the Cobhani's first 
theorem. 

Other characterizations were given in terms of congruences with finite index [Ei] 
and in terms of algebraic power series [CKMRj . Due to all these characterizations, 
many generalizations of Cobham's first Theorem can be stated (see [BHMV for a 
very nice overview or |AS| 1. In this work we are interested in two other directions : 
the dynamical systems and the tilings. 

In Section [2] we give the general background of this paper. We will see in this 
section that Cobham's first theorem can be stated using dynamical systems. This 
will enable us to give a new proof of this theorem prompted by ideas in |HZ| 
and [DHS| . Section [3] is devoted to the computation of the frequencies of the 
substitution language words. It is done in details in order to make the computation 
in the higher dimensional case easier. This will be used to prove Cobham's first 
theorem (in Section |4] for the primitive case and in Section [5] for the general case). 
The way we treat the primitive case strongly differs from what is done in |Du2j 
(where the substitutions are not necessarily of constant length). In [Du2| we use 
what is called "derived sequences" (see also [Dulj ) and here we use the frequencies 
of words. The interest of these two sections is that in the constant length context 
the proof is easier to read and to understand, and it makes the generalization to 
higher dimensions more comprehensible. This is done in Section [6l It gives a new 
proof of the Cobham-Semenov theorem in the primitive case. We will use ideas 
coming from tiling dynamical system theory and more specifically some results in 
[Soli ISo2| . The main argument we use consists of the description of the frequencies 
of some patterns that occur in the tilings we consider. 

2. Automata, substitutions, numeration and dynamical systems 

2.1. Numeration systems and automata. Let A be a finite alphabet. An au- 
tomaton over A, A = {Q, A, E, I, T), is a directed graph labelled by elements of 
A where Q is the set of states, / C Q is the set of initial states, T C Q is the set of 
terminal states and EcQxAxQ is the set of labelled edges. If (p, a, q) € E, we 
write p q . The automaton is finite if Q is finite. A path in the automaton is 
a sequence P = ((p„, a„, g„); < ti < N) where g„ — Pn+i for all < n < — 1. 
We say it is admissible if po belongs to / and belongs to T. We call L{A) the 
set of all admissible paths. The label of P is the word apai • • ■ a^. The set L{A) 
of labels of admissible paths is called the language of A. A subset L of A* is said 
to be recognizable by a finite automaton if there exists a finite automaton A 
such that L — L{A). 

A numeration system is a strictly increasing sequence of integers 



C/ = (C/„;n6N) 



such that 



(1) t/o = 1, 

(2) the set 




; ri G N > is bounded. 
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Let U — {Un;n G N) be a numeration system and c be the upper bound of 
1%^''^ € n|. Let Au be the alphabet {0, ••• ,c' — 1} where c' is the upper 
integer part of c. Using the EucUdean algorithm we can write every integer a; in a 
unique way as follows 

X = aiUi + ai^iUi-i H h aoUo] 



i is the unique integer such that Ui < x < t/j+i and Xi = x, a 
j S {!,••• where aj is the quotient of the Euclidean division of xj by Uj 
and Xj-i the remainder, and oq = xq. We will say that pu{x) = ai - ■ -ao is the 
[/-representation of x and we set 

L{U) = {0>[/(a:);n € N,x £ N}. 

We say a set S C N is [/-recognizable if the language 

0*Pu{E) = {0>c/(x);n e N,xeE} 
is recognizable by a finite automaton. 

We will focus on the standard numeration systems U = {p^;n S N) where p is a 
positive integer greater or equal to 2. For these numeration systems, we say that 
E is p-recognizablc and we set pu ~ pp and U = Up. 

Let us give some examples to illustrate these notion. Let 

El = {2n; n e N}, E2 = {2"; n € N} and 



|n e N;^ei =0[2],p2(n) =efe---eieo| 



We have 0*p2(£^i) = {wO; w e {0, 1}*}, 

0*p2{E2) = {0"10™;n,m e N} and 



0>2(£^3) 



'„G{0,ir;^i 



0[2] 



i=0 



Hence these sets are 2-recognizable, respectively, by the following automata. 








0,1 





Here an arrow going into a state means that this state is an initial state and an 
arrow going outside of a state means that this state is a terminal state. 
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The set Ei being an arithmetic progression, Cobham's first Theorem asserts that 
El is 3-recognizable. We have 0*p3iEi) = {wo---Wn € {0, 1, 2}*; X;r=o = ^M} 
and the automaton recognizing this set is the following. 





0,2 



Let us now consider the multidimensional case. Let d>l. We write (a;i, . . . , Xd) € 
N"^ in base p as a tuple of words, on the alphabet {0, 1, ... ,p — 1}'', with the same 

length by adding leading zeroes if needed. For example, in base 2 we write ^ 
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We say C N"* is ^recognizable if E written in base p is 



recognizable by a finite automaton. 

Cobham-Semenov theorem. Let p and q be two multiplicatively independent 
integers greater or equal to 2. Then, a set E c N** is both p-recognizable and 
q-recognizable if and only if E is semilinear, 

where semilinear means that there exist n G N and finite sets Vt CN'^, <i <n 
such that 



E = Vo[ji y ^ i;N 



l<i<n veVi 

2.2. Words, sequences and morphisms. We call alphabet a finite set of ele- 
ments called letters. Let A be an alphabet, a word on A is an element of the free 

monoid generated by A, denoted by A* , i.e. a finite sequence (possibly empty) of 
letters. Let be a word, its length is n and is denoted by 

The empty word is denoted by e, |e| = 0. The set of non-empty words on A is 
denoted by A+. If J = [i,j] is an interval of N = {0, 1 ■ • • } then xj denotes the 
word XiXi+i • ■ - Xj and is called a factor of x. Analogous definitions hold for open 
or semi-open intervals. We say that xj is a prefix of x when i = and a suffix when 
j = n ~ 1. If u is a factor of x, we call occurrence of ?i in x every integer i such 
that a;[j^j_)_|„|_i] = u. Let u and v be two words, we denote by Lu{v) the number of 
occurrences of m in 

The elements of A^ are called sequences. For a sequence x = (x„; n E N) = xqXi ■ • • 
we use the notation xj and the terms "occurrence" and "factor" exactly as for a 
word. The set of factors of length n of x is written i„(x), and the set of factors of 
X, or language of x, is represented by L(x); L(x) = U„gNL„(x). The sequence x is 
periodic if it is the infinite concatenation of a word v. A gap of a factor m of x 
is an integer g which is the difference between two successive occurrences of u in x. 
We say that x is uniformly recurrent if each factor has bounded gaps. 
Let A, B and C be three alphabets. A morphism r is a map from AtoB*. Such 
a map induces by concatenation a map from A* to B*. If t{A) is included in B+, 
it induces a map from A^ to S^. All these maps are written t also. 
To a morphism r, from A to B*, is associated the matrix = {mij)iQB,jeA 
where ruij is the number of occurrences of i in the word T{j). To the composition 
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of morphisms corresponds the multiplication of matrices. For example, let ti : B 
C* , T2 : A ^ B* and T3 : A C* he three morphisms such that ri o T2 = T3 (we 
will quite often forget the composition sign), then we have the following equality: 
Mt^Mt^ = Mi-g. In particular, if r is a morphism from A to A* we have M,-!. = M" 
for all non-negative integers n. 

2.3. Substitutions. A substitution on the alphabet ^ is a morphism a : A ^ A* 
satisfying: 

(1) There exists a G A such that a is the first letter of a{a)\ 

(2) For aU 6 e A, lim„_+oo W{b)\ = +00. 

Note that Condition ([T]) is not always required in the literature about substitutions. 
The language of a is the set L(a) consisting of all the words having an occurrence 
in some cr"(6), n g N and b & A. 

hi some papers (see |Pa| for example) the condition ^ is not required to be a 
substitution and our definition corresponds to what Pansiot call growing substi- 
tutions in Pa . 

It is classical that ((T"(aa • • • ); n g N) converges in to a sequence x. The substi- 
tution a being continuous on ^ this sequence is a fixed point of cr, i.e (t(x) = x. 
Whenever the matrix associated to r is primitive we say that r is a primitive 
substitution. It is equivalent to the fact that there exists n such that for all a 
and 6 in ^, a has an occurrence in (t"(6). Note that in this case L{a) = L(x) for 
all fixed points x of a. It is a substitution of constant length p if for all a G A 
the length of a{a) is p. We will say a sequence x is generated by a substitution 
of constant length p if it is the image by a letter to letter morphism of a fixed 
point of a substitution of constant length p. These sequences will be also called 
p-substitutive in this paper. Note that usually the definition of p-substitutive 
sequences is more general (see [Du5| for example). 

Let B be another alphabet, we say that a morphism </> from A to B* is a letter to 
letter morphism when (^{A) is a subset of B. Then the sequence (/i(x) is called 
substitutive, and primitive substitutive if r is primitive. The matrix of r is 
non- negative and consequently has an eigenvalue a G R such that : If /3 is any other 
eigenvalue then < a (this is a consequence of Perron-Frobenius Theorem, see 
[HJj ). We will say that it is the dominant eigenvalue or the Perron eigenvalue 
of T and that (/)(x) is a-substitutive. 

Let us explain how Cobham's first theorem can be stated using substitutions. First 
we need to remark that a set i? C N is a finite union of arithmetic progressions 
if and only if its characteristic sequence is ultimately periodic. Hence, due to 
Cobham's second theorem, it is an exercise to prove that Cobham's first theorem 
can be formulated in the following equivalent way : 

Cobham's first theorem (Substitutive Version) Let p and q be two multi- 
plicatively independent integers greater or equal to 2. Let A be a finite alphabet and 
x e A^ . Then the sequence x is generated by both a substitution of constant length 
p and a substitution of constant length q if and only if it is ultimately periodic. 

In the substitutive context, the two steps of the proof we mentioned before consist 
of proving that the letters of A appear with bounded gaps in x and then to conclude. 
Cobham's second theorem asserts that for each of these sets we can find some 
substitutions of constant length that generate their characteristic sequences. We 
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labelled the states to construct these substitutions. Let the set of states be the 
alphabet A of the substitution. The image of the state a is the word '(/;o ■ • -w^ai-i 
where Wi is the state you reach starting from a and passing through the arrow 
labelled by i. The substitutions we obtain for Ei (in base 2), Ei (in base 3), E2 
and Es are : 

ai : a ^ ab ai : a ^ aba 01: a ^ ab as : a ^ ab 
b ^ ab b ^ bob b ^ be b ^ ba 

c ^ cc 

Take one of these substitutions and call x its unique fixed point starting with the 
letter a, then, identify to 1 the terminal states and to the other states. We obtain 
the characteristic sequence y of the corresponding set of integers E. For S3 we 
obtain the well-known Morse sequence y : 



X 


= a 


b 


b 


a 


b 


a 


a 


b 


b 


a 


i 






















y 


= 1 








1 





1 


1 








1 
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E3 


: 
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2.4. Dynamical systems and subshifts. By a dynamical system we mean a 
pair {X, S) where X is a compact metric space and S a continuous map from X to 
itself. We say that it is a Cantor system if X is a Cantor space. That is, X has a 
countable basis of its topology which consists of closed and open sets and does not 
have isolated points. The system {X, S) is minimal whenever X and the empty 
set are the only 5-invariant closed subsets of X. Wc say that a minimal system 
{X, S) is periodic whenever X is finite. We say it is p-periodic if #(X) = p. 
Let {X, S) and {Y, T) be two dynamical systems. We say that {Y, T) is a factor of 
{X, S) if there is a continuous and onto map (j) : X ^ Y such that (j) o S ~ T o (p (^ 
is called factor map). If is one-to-one we say that </> is an isomorphism and 
that {X, S) and {Y, T) are isomorphic. 

In this paper wc deal with Cantor systems called subshifts. Let A be an alphabet. 
We endow A^'* with the topology defined by the metric 

'^(x>y) = ^ with n = inf{|A;|;Xfc ^ yfc}, 

where x = (x„; n G N) and y = (y„; n G N) arc two elements of A^ . By a subshift 
on A we shall mean a pair (X, T|x) where X is a closed T-invariant {T{X) = X) 
subset of A^ and T is the shift transformation 
T : AN ^ AN 

(x„;nGN) (x„+i;nGN). 
We call language of X the set L{X) — {x[ij]; x G X,i < j}. Let u be a word of A*. 
The set 

[v]x = {x G X;X[oj„|_i] = u} 
is called cylinder. The family of these sets is a base of the induced topology on 
X. When it will not create confusion we will write [u] and T instead of [u]x and 
T\x- 

Let X be a sequence on A and f2(x) be the set {y G A^; y^^j^ g i(x), V C N}. It 
is clear that (0(x), T) is a subshift. We say that (f2(x), T) is the subshift generated 
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by X. We remark fl{x) — {T"-x; n e N}. Let {X, T) be a subshift on A, the following 
are equivalent: 

(1) {X,T) is minimal. 

(2) For all X G X we have X 

(3) For all X G X we have L{X) = L(x). 

We also have that (ri(x), T) is minimal if and only if x is uniformly recurrent. Note 
that if (y, T) is another subshift then, L[X) = L{Y) if and only if X = F. 
For primitive substitutions cr, all the fixed points are uniformly recurrent and gen- 
erate the same minimal subshift, we call it the substitution subshift generated 
by a and we denote it {X„^T) (for more details see |Qu| .) 

There is another way to generate subshifts. Let L be a language on the alphabet 
A and define Xl C to be the set of sequences x = (x„)„gN such that each word 
of L(x) appears in a word of L. The pair {Xl,T) is a subshift and we call it the 
subshift generated by L. If cr is a primitive substitution, then Xa- = Xl where 
L = {(T"(a);a G A}. . 

It is easy to show that if x is an ultimately periodic sequence and (il(x),T) is 
minimal, then x is periodic. From all the remarks we made before it is not surprising 
that the following theorem holds : 

Theorem 1. Let a and r be two primitive substitutions with dominant eigenvalues 
a and (3 respectively. Suppose a and [3 are multiplicatively independent. Then, 
{X„,T) is isomorphic to {Xt,T) if and only if they are periodic with the same 
period. 

The definition of multiplicative independence of real positive numbers is the same 
as the one we gave before for positive integers : two positive real numbers a, (3 
are multiplicatively independent if whenever a" — /J™, n,m G Z, this implies 
n = m = 0. Remark that it is equivalent to the fact that is an irrational 
number. 

We will prove this theorem later and show why it implies Cobham's first theorem. 
A key notion in this proof is the notion of return words. 

2.5. Return words. For the rest of the section x is a uniformly recurrent sequence 
on the alphabet A and (X, T) is the minimal subshift it generates. We recall that 
all sequences in X are uniformly recurrent. Let w be a non-empty word of L{X). 
A word w on j4 is a return word to m in x if there exist two consecutive occurrences 
j, fc of M in X such that w = X[j,fe)- The set of return words to u is denoted by 7^„(x). 
It is immediate to check that a word w G A'^ is a return word if and only if: 

(1) uwu G L{x) (i.e. uwu is a factor of x); 

(2) M is a prefix of wu] 

(3) the word wu has only two occurrences of u. 
Remarks. 

(1) As X is uniformly recurrent, the difference between two consecutive occur- 
rences of u in X is bounded, and the set 7?.m(x) of return words to u is 
finite. 

(2) The previous statement (2) cannot be simplified: it is not equivalent to u 
is a prefix of w. For example, if aaa is a factor of x then the word a is a 
return word to aa. 
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(3) From this characterization, it follows that the set of return words to u is 
the same for all y E X, hence we set TZu{X) — TZu{x). 
If it is clear from the context, we write TZu instead of 7?.„(x). 

3. Some useful properties of the substitutions 
In this section we develop the tools we will use to prove Theorem [T] 

3.1. Return words of a substitutive sequence. We say that a sequence x on a 
finite alphabet is linearly recurrent (with the constant ii' e N) if it is recurrent 
and if, for every word w of x and all w G TZu it holds 

< K\u\. 

We say {X, T) is a linearly recurrent subshift if it is a minimal subshift that 
contains a linearly recurrent sequence. 

Proposition 2. All primitive substitutive sequences, and the subshifts they gener- 
ate, are linearly recurrent. 

Proof. Let r be a primitive substitution and x one of its fixed points. We begin 
proving that there exists a constant C such that for all positive integers k 

(3.1) Sk = max{|r'=(a)|;a e A} < Cmin{|T'=(a)|; a e A} ^ Ch ■ 

For all k we choose some letters au and bk such that \T^{ak)\ = Ik and |r'^(6fe)| = Sk. 
By primitivity there exists fco such that for all a,b € A the letter b has an occurrence 
in the word T^°{a). We set C — \T^"{bko)\- For k > ko we have 

Sk = Ir^ibk)] = |t'=° p-'=°(6fc))| < C|T'=-'=°(6fe)| <C\T>'{ak)\ = Ch. 

Let M be a word of £(x) and w be a return word to u. Let k be the smallest integer 
such that Ik > The choice of k entails that there exists a word ab G L{x) of 
length 2 such that u occurs in [ab) . Let R be the largest difference between two 
successive occurrences of a word of length 2 of L{t). It follows 

\w\ < RSk < RCIk < RCSih-i < RCSi\u\ . 

□ 

When the substitution a is primitive we will also say that cr is linearly recurrent 
with some constant. 

Proposition 3. Let x £ A^ be an non-periodic linearly recurrent sequence for the 
constant K . Then: 

(1) The number of distinct factors of length n of y. is less or equal to Kn. 

(2) X is [K + \) -power free (i.e. G -^(x) */ and only ifu = e). 

(3) For all u G L{x) and for all w G TZu we have [1/ K)\u\ < \vu\. 

(4) For all u G L{x), #TZu < K{K + if. 

Proof. We start with a remark. Let n be a positive integer and u £ L{x) a word of 
length {K + l)n — 1. Let v G L{x) be a word of length n. The difference between 
two successive occurrences of v is smaller than Kn, consequently u has at least one 
occurrence of v. We have proved that: For each n, every words of length n has at 
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least one occurrence in each word of length {K + l)n — 1. From this remark we 
deduce (1). 

Let u G L(x) be a word such that e L{x). Each factor of x of length \u\ 

occurs in But in occurs at most |u| distinct factors of length |u| of x. 

This contradicts the non-periodicity of x. (We recall that if for some n a sequence 
y G has at most n different words of length n, then it is ultimately periodic, see 
[HM].) 

Assume there exist u G L{x) and w G TZu such that > \w\. The word w is a 

return word to u therefore m is a prefix of wu. We deduce that is a prefix of u. 
Hence belongs to L(x) because wu belongs to L{x). Consequently w — e and 

(3) is proved. 

Let u be a factor of x and v G L{x) be a word of length {K + Each word of 

length (K + occurs in v, hence each return word to u occurs in v. It follows 
from (3) that in v will occur at most K{K + l)^\u\/\u\ — K{K + 1)^ return words 
to u, which proves (4). □ 

3.2. Perron Theorem and frequencies of the letters. The following well- 
known theorem is fundamental to prove the existence and compute the frequencies 
of the words in fixed points of substitutions. The proof can be found in [HJ' or 
[LM]. 

Theorem 4. Let M be a d x d primitive matrix. Then : 

(1) The matrix M has a positive eigenvalue 9 which is strictly greater than the 
modulus of any other eigenvalue; 

(2) The eigenvalue 9 is algebraically simple; 

(3) To this eigenvalue corresponds an eigenvector with positive entries. 

(4) There exist < r < 9 and C such that for all i,j G {1, . . . , d} and all n G N 
we have 

\M^^ - r,lj9''\ < Cr'', 

where (ri, . . . , r^i) and (^i, . . . , I a) are respectively the unique right and 
left eigenvectors satisfying 

(3.2) ^ Ta = 1 and ^ rJa = 1- 

Let a : A ^ A* he & primitive substitution, M its matrix and x one of its fixed 
points. The eigenvalue 9 of the previous theorem will be called the Perron eigen- 
value of M or a. These real numbers are called Perron numbers. We take the 
notations of the previous theorem. 

For a word u G L(x) we call frequency of u in L{x) the limit (when it exists) 
freq^(u) = lim _ ■r^# {O < i < |w| - |u| - 1; u = } . 

\v\^oo,v^L{x) \V\ 

We recall that for all n and all a,bui A we have |cr"(6)|Q = (M")a,h. Consequently 
from Perron theorem we obtain, for all ti G N, 
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(3.3) ||a"(5)U-r,4r| <Cr'\ thus 

(3.4) ||a"(6)| - <(#A)Cr" and 

(3.5) lk"(6)|a - r,|a"(6)|| <C7(1 + #A)r". 



We set C = C(l + #^). We fix a e A. Now we prove that freq^(a) exists. Let 

V G L{a). There exist n, and, words Vi (0 < i < n) and Wi (0 < i < n) such that 

(1) l^il ^ and Iwil < L for < i < n where L = max^g^ |o'(^)h 

(2) Vn is non-empty; 

(3) V = WoCr(wi) • ■ •Cr"~H^^«-l)c^"(«n)cr"~^("'n-l) ' ' ' (wi)wo. 

Moreover from ((O)) there exists a constant C" > such that |cr"(u)| > C"|u|6'" 
for all u e i(cr). Hence 

\\v\a-ra\v\\<2C' ''''^' ~^ <C"'\vr, 
r — 1 

for some constants C" ^C" , where a — log r/ log 6* < 1. This means that freq^(a) 
exists and is equal to Tq. 

3.3. Substitutions of the words of length n and frequencies of words. Here 
cr : A — > A* is a primitive substitution. In this section we prove the frequency of 
words exists for primitive substitutions. We use the previous subsection and the 
following substitutions. 

Let fc > 1 be an integer and Ak be the set of words of length k belonging to L{a); We 
consider Ak = {(u); u £ L{a), \u\ — k} as an alphabet and we define a substitution 
(Jk '■ Ak A"^ in the following way: For (u) G Ak with (t{u) = u = ui . . . Vm and 
p — \a{ui)\, we put 

(^kiiu)) = (^'[l,fe])(^'[2,fc+l]) • • • iv[p^p+k~i])- 
In other words, (Jk{u) consists of the ordered list of the first |(t(ui)| factors of length 
k of a{u). Remark that, for every n > 1, cr^ is associated to cr" in the same way 
as cTfe is associated to a: ^^^((u)) consists of the ordered list of the first |cr"(Mi)| 
factors of length k of (t"(u). In particular we have: 

(3.6) K((«))| = |ct"(^i)|. 

If n is large enough, every v G L{a) of length A; is a factor of cr"(a) for every 
a E A; Thus, (v) G Ak occurs in cr^'((u)) for every (u) G Ak- We proved that ak is 
primitive. 

Let w be a word of length n > over the alphabet Ak- From the definition of L{a) 
and L((7k) it can be checked that : w G L{ak) if and only if there exists a word 

V G L{a) of length n + A: - 1 such that w = (w[i,fc])(w[2,fe+i]) • . ■ {v[n,n+k~i\)- Clearly, 
given (u) G Ak, the number of occurrences of the symbol {u) in w is equal to the 
number of occurrences of u in w. And consequently 

(3.7) freq^(u) = freq^^((u)). 

Finally applying the results of the previous subsection to Ok for all k we obtain the 
following result. 
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Proposition 5. For all u G L{a) there exist constants hc(\^[u), D and a < \ such 
that for all v G L{a) of length greater than \u\ we have 

\\vU - ireqAu) {\v\ - \u\ + l)\ < D i\v\ \u\ + if . 
Corollary 6. The frequency of u exists for all u G L{a). 



In [HZj is proved the following theorem which is central in the present paper. They 
use the recognizability property of primitive substitutions. Wc will give a different 
proof not using this property but Proposition [31 

Theorem 7. Let 9 be the Perron eigenvalue of a. There exists a finite set F dM. 
such that for all n there exists fc G N satisfying 

{freq^(u); u G L(cr), |u| = n} C {s0~''; s G F} . 

Proof. We suppose a is linearly recurrent with the constant K. Let 62 be the Perron 
eigenvalue of 172. From (13. 6p and Perron Theorem we deduce that 62 — 6. Let M2 
be the incidence matrix of 02- From (13. 7p we know that freqg.(u) = freq^.,-, ((u)) for 
all u G A2- Subsection 13.21 and Theorem |4] imply that (freq^^ ((u)); (u) G A2) is the 
unique right eigenvector of il/2 (for the eigenvalue 62) with X)(«)eA2 (('^)) ~ 

Let C be the constant defined as in (|3.ip . Let u G L{a) be a word of length n and 
k the smallest integer such that 

\u\ — n < Ik ^ mill |o''°(a)| 

aGA 

Let B be the set of words {ah) G A2 such that u has an occurrence in a*' {ah). The 
choice of k implies this set is non-empty. Let {ah) G B and M = max{|iT(c)|, c G A). 
From Proposition [3] it follows that 

\u\/K \u\ 
Moreover from (13. 4p we have that 

i^"";<°)i.<,> 

and from Proposition [SI for all c E A, 

freq,(u) = Jmi^ |^.n+fc(c)| " 
Let be the last word of length two of a"^{c). Then, 

W^+Hc)\u EateA, {\'jHab)\u - k-"(c)|„fa + \<jHb')\u 

|o-™+'=(c)| |cr™+'=(c) 

k'"(c)|a6 |a™(c)| , |a'=(6')|u 



^ (|a'=(a5)|„-|a'=(6)U)freq,(a5)0-^ 
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Consequently, it suffices to take 



^^{Y. Jabiyeq,{ab);jab e [0, 2KMC] n N, a6 G ^2 [ 
which is a finite set. □ 



4. Cobham's theorem for minimal substitutive systems 
In this section we prove the following theorem. 

Theorem 8. Let a and t be two primitive substitutions whose dominant eigen- 
values are respectively a and (3. Suppose that {X(j,T) and (Xt,T) has a common 
factor (y, T) . IfY is non-periodic then a and (3 are multiplicatively dependent. 

Theorem [T] is clearly a corollary of Theorem \8\ 

4.1. Preimages of factor maps of LR subshifts. Let be a factor map from 
the subshift {X,T) on the alphabet A onto the subshift (Y,T) on the alphabet B. 
If there exists a r-block map / : A^^^^ — > B such that {(j){x))i — f{x[i-r,i+r]) for 
alH G N and a; e X, we shall say that / is a block map associated to 0, that / 
defines cj) and that </> is a sliding block code. The theorem of Curtis-Hedlund- 
Lyndon (Theorem 6.2.9 in [LMj ) asserts that factor maps are sliding block codes. 
If u = uqUi ■ ■ ■ is a word of length n > 2r + 1 we define f{u) by {f{u))i ~ 
f{uii,i+2r]), j e {0, 1, • • • , n - 2r - 1}. 

Let C denote the alphabet ^^''+^ and Z — {{{x[^r+i,r+i])',i e N) G C'';(a:„;n £ 
N) G X}. It is easy to check that the subshift {Z,T) is isomorphic to {X,T) and 
that / induces a 0-block map from C onto B which defines a factor map from {Z, T) 

onto(y,T). 

The next lemma was first proved in IDu4| . 

Lemma 9. Let {X,T) be a non-periodic LR subshift (for the constant K) and 
(y, T) be a non-periodic subshift factor of {X,T). Then (Y,T) is LR. Moreover, 
there exists ni such that: For all u G L{Y) with \u\ > ni we have 

(1) \u\/2K < \w\ < 2K\u\ for all w G 7^„(y) ; 

(2) i^{nu{Y))<2K{2K+lf. 

Proof. We denote by A the alphabet of X and by B the alphabet of Y . Let 
(j) : {X, T) — > (y, T) be a factor map. Let / ; A'^'''^^ Bhe a. block map associated 
to (j). 

Let It be a word of L{Y) and v G L{X) be such that f{u) — v. We have |u| = |v|— 2r. 
If w is a return word to u then \w\ < max{|s|; s G TZv} < K\v\ < K(\u\ +2r). Then, 
the subshift (Y, T) is linearly recurrent with the constant K{2r+ 1). Moreover: For 
all u G L{Y) such that |m| > ni — 2r, and for all w G TZu, \w\ < 2K\u\. To obtain 
the other inequality it suffices to proceed as in the proof of Proposition [3) 
Let u G L{Y) with \u\ > ni and v G L{Y) be a word of length {2K + 
Each word of length {2K + occurs in v, hence each return word to u occurs 
in V. It follows from the previous assertion that in v occurs at the most 2K{2K + 
iy\u\/\u\ = 2K{2K + 1)2 return words to u. □ 

Proposition 10. Let {X,T) be a non-periodic LR subshift (for the constant K). 
Let (j) : (X, T) — > (Y, T) be a factor map such that (Y, T) is a non-periodic subshift 
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and f : A^^^^ B be a r-block map defining (p. Then there exists no such that for 
all u G Y , with \u\ > no, we have 

W{{u}))<4K{K+l). 

Proof. Let m be the integer given by Lemma [9l We set no = max(2r + l,ni). 
Let u G L{Y) such that |m| > no. The difference between two distinct occurrences 
of elements of f^^{{u}) is greater than \u\/2K. Moreover f~^{{u}) is included in 
n ^1"!+^'" and each word of length (i^ + l)(|u| + 2r) has an occurrence of each 
word of L{X) n Therefore 

This completes the proof. □ 

4.2. Frequencies in the factors. Let cr be a primitive substitution with dominant 
eigenvalue a and linearly recurrent constant K, and, {Y, T) a non-periodic factor 
of (XctjT). Let : F be a factor map and / be a r-block map that defines 

<j). From Theorem [7] we know there exists a finite set C M such that for all n 
there exists fc e N satisfying 



(4.1) {freq,(w); w e L{X,), \v\ ^ n} C s £ F,} . 

Let u G L{Y), \u\ = TO, and set f~^{{u\) = C L\^\^_\_2r{X„) with 

I < AK{K + 1) (Proposition [TU|) . Let k be as in (jil^ for n^\u\ + 2r. Let y e Y 
and X e X^r such that (f){x) — y. We remark that 

11 ^™ r A# {0 <i <\v\~\u\;u^ V[u+\u\-i] } 

\v\~*oo, veL{Y) \v\ I . I I J J 

exists and is equal to 

lim ^ ■p-T# {O < i < |w| - |u| 2r; W[i.i+|„|+2r-i] e {vi, •■• 
We denote it freqy(u). Moreover, 

I 

beqyiu) = ^freq^^K) £ {s'^^s' G F^} , 

1=1 

where F^ is the finite set {Eif/^"^^' /»; e F^, 1 < i < 4/^(74- We proved: 

Theorem 11. Let 9 be the Perron eigenvalue of the primitive substitution a. There 
exists a finite set F C M such that for all non-periodic subshifi factor {Y, T) of 
{Xcr,T), and all n there exists k gN satisfying 

{freq^(u);u G L{Y), \u\ = n} C {fO"; / G F} . 
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4.3. Proof of Theorem [HI From Theorem [Til we know there exist two finite sets 
Fa and Ft such that for all n G N there exists fc, /c' e N satisfying 

{freqy(M);w e L{Y), \u\ = n} C {sa'^;s e S^] n |s/5''';s e . 

Using Statement (1) of Lemma[9|we have that freqy(M) < 2K/\u\ for all u e LiY). 
Then, freqy(u) tends to when \u\ goes to infinity. Consequently, there exist 
w, u e L{Y), s e Sa,t e Sr, k,k' eN, k', and 1,1' eN,l ^ I', such that 

sa'^ = freqy(M) = and sa'" = freqy(w) = tp' . 
We obtain that a'' which ends the proof. 

5. Cobham's theorem for substitutions of constant length 

In this section we prove the substitutive version of Cobham's first theorem. The 
arguments presented below also work for a wide class of non-constant length sub- 
stitutions that are called in |Du5| "good substitutions". Wc first prove that the 
letters appear with bounded gaps. Then, we could conclude using the previous 
section. The sufficient part is easier to establish. 

5.1. Letters and words appear with bounded gaps. The following theorem 
is a key argument in what follows. 

Theorem 12. Let a and (3 he two multiplicatively independent positive real num- 
bers. Then the set 

fa" 1 
< —\n,m G N > 
\/3™ J 

is dense in R+. 

Proof. See [HW]. □ 

Let p and q be two multiplicatively independent Perron numbers. Let a (resp. t) 
be substitution of constant length p (rcsp. q) defined on the alphabet A (resp. B), 
with fixed point y (resp. z). We suppose there exist two letter to letter morphisms 
(j): C and ip : B C such that 0(y) = V(z) = x. 

Lemma 13. Let a E A be a letter which has infinitely many occurrences in x. 
There exist a positive integer I, a word u £ A* and v,w G A* such that for all 
n G N the word 

a'"(u)fT'("-i) (t;)a'("-2) . . . 

is a prefix of y. 

Proof. Let a G A be a letter that has infinitely many occurrences in x. We set 
flQ = a. There exists ai G ^ which has infinitely many occurrences in x and 
such that flQ has an occurrence in a{ai). In this way we can construct a sequence 
{afyi G N) such that oq = a and occurs in cr{ai^i), for all z G N. 
There exist i, j with i < j such that a; = aj = b. It follows that a occurs in ct* (6) and 
b occurs in a^~^{b). Hence there exist Mi,U2,fi,f2 G A* such that <7^{b) = uiau2 
and a^~^{b) = Vibv2. We set p = j — i, v — (T*(t;i) and w — u\. There exists u 
such that u 6 is a prefix of x. We remark that for all n G N the word u" [u b) is a 
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prefix of x too. We set u — a^{u ). We have aP{u b) — aP{u )vibv2- Consequently 
for all n G N 

^P"(y')crP(«-i)(wi)af("-2)(vi) . ■■aP{vi)vib 
is a prefix of a'^P{u b). Then 

Crf»(u)crP("-l)(t;)crP("-2) („)... (jP{v)vwa 

is a prefix of (7"p+*(w b) and consequently of x, for all n G N. □ 

Proposition 14. The letters of C which have infinitely many occurrences in x 
appear in x with bounded gaps. 

Proof. Let c G C which has infinitely many occurrences in x. Let X = {n G N; x„ = 
c} and A = {a G A;(j){a) = c}. Assume that the letter c does not appear with 
bounded gaps. Then there exist a & A with infinitely many occurrences in y and 
a strictly increasing sequence n G N) of positive integers such that the letter c 
does not appear in (/)((tP" (a)). 

Let u € A* such that ua is a prefix of y. Of course we can suppose that u is 
non-empty. 

For all n G N we denote r2„ C A the set of letters appearing in ctP" (a) . There exist 
two distinct integers ni < n2 such that 17„j — il„2 . Let be the set of letters 
appearing in crP"2-P"i (ri„J. It is easy to show that V, = = iln2- 
Consequently the set of letters appearing in (tP"2-P"i is equal to and for all 
fc G N the set of letters appearing in (TP"i+'"''-P"2~P"i)(a) is equal to Q. Wc set 
/ — Pni and g — ~ Pni ■ We remark that the letter c does not appear in the 
word (j){af+^3(^a)) and that 



(5.1) [\af+''^u)\, \af+''3{ua)\[ nX = 0, 

for all fc G N. Let e be such that |m|(1 + e) < lua|(l — e). Thus, a being of constant 
length p we have 

(5.2) \u\ = < 1^1(1 + e) < |„a|(i - e) < J^^ 

From Lemma [T51 applied to t we have that there exist s,t,t <E B* and ft, G N such 
that for all n G N 



(5-3) V" (y[|T''"(s)T'»("-i)(t)---r''(t)tt'ii) — 

Put 7 = |s| + |t|/(g'' - 1). We have 

lim |r'™(s)T'*("-i)(i)---T'*(i)tt'|/9''" = 7- 

m — *+oo 

From Theorem[T2]it follows that there exist two strictly increasing sequences of inte- 
gers, {mi\i G N) and (n^; i G N), and I G M such that the sequence [jq"^^^ Ip^^a+f^ 
tends to ^ G ]|u|(l -|- e), |ua|(l — e)[. Hence, 
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From ()5.2|) and ()5.4|) there exists i e N such that 

|cr"'f+/(u)| < |T''™'(s)T''('"'-l)(t)---T'^(t)tt'| < \(7'''S+f(^ua)\, 

but |r'""'(s)T''("'-i'(i) • ■■T''{t)tt\ belongs to X by JHSI). This gives a contradic- 



tion with ((57T|) . □ 
We obtain the same conclusion for the words : 

Proposition 15. The words having infinitely many occurrences in x appear in x 
with bounded gaps. 

Proof. Let u be a word having infinitely many occurrences in x. We set |u| = n. 
To prove that u appears with bounded gaps in x it suffices to prove that the letter 
1 appears with bounded gaps in the sequence t G {0, l}*^ defined by 

ti = I if X[,j^j+„_i] = u 

and otherwise. In the sequel we prove that t is p and g-substitutive. 

The sequence y^"^ = ((Vi ■ ■ ■ Yi+n-i); « G N) is a fixed point of the substitution 

(7„ : An A* , where An is the alphabet A", defined for all (ai . . . a„) in An by 

cr„((ai . . .On)) = (5i . . .bn){b2 ■ ■ ■ 6„+i) ...{bp.. .6p+„-i) 

where a{ai) — bi . . .bp (for more details see Section V.4 in Qu| for example). 
Let p : An A* he the letter to letter morphism defined by p{{bi ■ ■ ■ 6„)) = bi 
for all (5i • • -bn) £ An. We have p o an = cr o p, and then MpM^^ = M^Mp. 
Consequently the dominant eigenvalue of cr„ is p and y^") is p-substitutive. 
Let / : An {0, 1} be the letter to letter morphism defined by 

f{{bi ■ ■ ■ bn)) ~ 1 it bi ■ ■ ■ bn = u and otherwise. 

It is easy to see that /(y*-"') — t hence t is p-substitutive. 

In the same way we show that t is ^'-substitutive and Proposition [T3] concludes the 
proof. □ 

5.2. Conclusion for the necessary part of the substitutive version of Cob- 
ham's first theorem. We keep the notations of the previous subsection. 

Lemma 16. There exist k £ N and A £ A such that a^^ is a primitive substitution. 
Proof. The proof is left to the reader. □ 

Let A and B be given by Lemma 1161 for respectively a and t. We remark 
that (jf^r is a substitution of constant length p'^ and t]- a substitution of constant 

length qK Let {Xp,T) and [Xq^T) be the substitutive dynamical systems they 
generate. Each element of (f){L{(j'^£j) and ip{L{T^^^)) appears infinitely many times 

and consequently with bounded gaps in x. Hence (j){L(a'^r)) = ipiLir!^)), we caU it 

L. Let (X, T) be the dynamical system L generates. We remark that 4> : Xp X 
and -0 '■ Xq X defined factor maps. Theorem [5] implies {X, T) is periodic. Hence, 
there exists a word u such that L = L{u'^) C L{x) where |u| is the least period. 
There exists an integer N such that all the words of length |u| appear infinitely 
many times in xjvx^v+i • • ■ • We set t = xatXjv+i ■ • • and we will prove that t is 
ultimately periodic and consequently x will be ultimately periodic. 
The word u appears infinitely many times, consequently it appears with bounded 
gaps. Let TZu be the set of return words to u. It is finite. There exists an integer 
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N' such that aU the words w G TZu H i(xjv'XAr'+i • • • ) appear infinitely many times 
in X. Hence these words appear with bounded gaps in x. We set t' = kn'X-n'+i ■ • ■ 
and we will prove that t' is periodic and consequently x will be ultimately periodic. 
We can suppose that « is a prefix of t'. Then t' is a concatenation of return words 
to u. Lot w bo a return word to u. It appears with bounded gaps hence it appears 
in some u" and there exist a suSix r and a prefix s of w, and an integer i such that 
wu = ru^s. As |u| is the least period of L it follows that wu = u^. It follows that 
t' = uuu ■ ■ ■ . 

5.3. Proof of the sufficient part. 

Proposition 17. Let x be a sequence on a finite alphabet A and p>2 an integer. 
If X is ultimately periodic then it is p-substitutive. 

Proof. Suppose x = u'v'v'v' .... It is easy to sec that x = uvvv . . . where \u\ = 
\v\ G plj for some well chosen u and v. We set \u\ = pi. Take the alphabet 
B = {ao, ai, a2, . . . , a2p«-i} and define the substitution a : B ^ B* hy 



We call y the fixed point of a starting with ao- Now we define a letter to letter 
morphism (j) : B —> Ahy 



6. Multidimensional substitutions and Cobham-Semenov theorem 

In this section we prove the necessary part of the substitutive version of the 
Cobham-Semenov theorem in the primitive case. 

6.1. Arrays and patterns. Let ^ be a finite alphabet. We call array in N'^ any 
function T : N'^ ^ A. The collection of all these arrays is that we endow with 
the product topology of the discrete topologies. 

A non-empty set C = { , . . . , C N'' of pairwise distinct elements is called 
a configuration. A C-pattern P is a map from C to A. A C-pattern P of T is 
defined as the function Tq^^ : C ^ A with 7^^^("x ) = T("? -|- 'v') for all Ic G C, 

for some V G N"^. When "if = we set Tc = 7^ -q> . The set C is the support of P 

and is denoted by supp(P). We set ^| = #C. An occurrence of the C-pattern 
P in T is a vector V G M'* such that P = Tq^. In this situation we will say that P 
has an occurrence in T or P appears in T. The C-language of T is defined to be 
the set Lc{T) of C-patterns having an occurrence in T: Lc{T) = {7^ ^; V G N''}. 
When P' is a pattern we define in the same way the notion of occurrence of P in 
P' and the related notions. _^ _^ 

We will write the elements of N'' as j = (ji, j2, . . . , jd); we set \j \ = maxi<i<d\ji\- 
The cube located at ^ e N'^ of side P G R+ is the set B(f , R) = {k e N''; < 
ki — ji < R,i = 1, . . . ,d}. A cubic- pattern P is a C-pattern where C = B{ j , R) 
for some P > and j . We also say P is a P-cubic-pattern. The 1-cubic-patterns 





Then it can be checked that x = which achieves the proof. 



□ 
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can be viewed as letters of A; when it is clear from the context we will identify 
them. 

Let X be a subset of A^^ and P be a pattern. We denote by [P]x the subset of X 
consisting of the elements T € X such that %uppP = P- When P is a cubic-pattern 
with suppP = P( , R), we say [P]x is a cubic-cylinder. 

A subset Z of N'^ is syndetic if there exist R such that for each cube C of side R 
the set Cn2' is non-empty. We say T is syndetic if 7^ = T~^({a}) is syndetic for 
all a ^ A. We say T is uniformly recurrent if for every pattern P there is P > 
such that P appears in every P-cubic-pattern of T. It clearly fits the definitions of 
the one-dimensional case. 

6.2. Periodicity and definability. We say T is periodic (resp. ultimately 
periodic) if there exists k E such that T{ k + j ) = T{ j ) for all j e N"* 
(resp. for all large enough j ). We will also need another notion of periodicity. We 
say Z C N"* is j -periodic inside X C N'* if for any it E X with li + j E X we 
have 



We will say that Z is locally periodic if there exists a non-empty finite set V of 
non-zero vectors w S N"* such that for some K > max{|v|; v E V} and L > one 
has: 

(VT" e N'', > L){3lt E V){Z is V - periodic inside B{f, K)) 

Remark that for d = 1, local periodicity is equivalent to the ultimate periodicity. 
We say T is pseudo-periodic if for all a E A, Ta is locally periodic and every 
{d — l)-sections of Ta, S{i,n) = {j E Ta;ji = n}, 1 < i < d and n e N, is 
pseudo-periodic (ultimately periodic when d—1 = 1). We say P C N"^ is definable 
whenever its characteristic function is pseudo-periodic. The following criterion is 
due to Muchnik (see BHMV for the proof). 

Proposition 18. The following are equivalent: 

(1) T is pseudo-periodic; 

(2) For all a E A, there exists n E N and finite sets Vi C N'^, Q < i < n such 



6.3. N'^-subshifts. The translation of T E A^ by j e N'* yields the array T + j 
whose k -th element is 



This is also called the N'^-shift; it shifts the array so that the element that used to 
be at i is now at . A N'^-subshift is a pair [X, AN ) where X is a closed subset 
of which is invariant by all translations under j E N''. We say (X, ) is 
minimal if all orbits are dense : for all T G X, 

[T + E n'^}^X. 

In this case all elements of X share the same patterns. We denote the set of all 
these patterns L{X) (the language oi X). 



u E Z li and only if u + j E Z. 



that 




{T+ j){k) = T{k + j). 
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6.4. Multidimensional substitutions. Let p be a positive integer and A a finite 
alpiiabet. A p-substitution (or substitution if we do not need to specify p) is a map 
S : A^p wliere Bp = 3(0 ,p) = nf^^{0, ■■■ ,p-l}. The substitution S can 
be considered as a function from into itself by setting 



S{S'^ ^{a)) for all a £ A and n > 0. To 5 is associated its incidence matrix 
M{S) e A4axa(Z+) defined by 



We say S is primitive whenever ^^(5*) is primitive (i.e. there exists k such that 
M{S)^ > 0). This means that for every a & A there exists h & A such that a 
appears in S^{b). When S is primitive there always exist Tq G j^'^ and / > such 
that S^{Tq) = Tq. We say T is generated by a p-substitution if there exist 
a letter to letter morphism and a fixed point Tq of a p-substitution such that 
T= cl)o%. 

Let X be the subset of A^^ consisting of the arrays T verifying that for each 
subblock B of T there exist j G N and a ^ A such that i? is a subblock of S^{a). 
It can be checked that X is closed and translation invariant. The action of N"^' by 
translations on X is a N'^-subshift called substitution sub shift or N'^-subsh ift 
generated by a p-substitution. The orbit closure of Tq, {To + G N''}, is 

not necessarily equal to X. But it is the case when S is primitive. 

6.5. Substitutive version of the Cobham-Semenov theorem. In this sub- 
section we "translate" the Cobham-Semenov theorem in terms of substitutions to 
obtain an equivalent statement we will prove in the primitive case. In |CG] the 
authors proved the following theorem which is analogous to Cobham's second the- 
orem. 

Theorem 19. Let p > 2 and d> 1. Let E C N''. The following are equivalent. 

(1) E is p-recognizahle, 

(2) the characteristic function of E is generated by a p- substitution. 
Hence we can reformulate the Cobham-Semenov as follows. 

Cobham-Semenov theorem (Substitutive version). Letp and q be two multi- 
plicatively independent integers greater or equal to 2. Then, the array T is generated 
by both a p- substitution and a q- substitution if and only if E is pseudo-periodic. 

We will prove the necessary part of this result in the primitive case using N"*- 
subshifts and the same succession of arguments as for d = 1. 

In the sequel S' : A — > A^ is a primitive substitution, with D = {0, 1 . . . , ^ — 1}''. 
We set 9 = 1'^. It is not a restriction to suppose that S has a fixed point Tq. Let 
[X, N'*) be its associated N''-subshift and M G A4axa(Z^) be its incidence matrix. 




We remark that S^{a) 



#{fc eB.p-S{b)^ 



a}. 
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6.6. Linear recurrence. A proof of the following result for d = 1 can be found in 
[DHS| and for d > 1 in So2]. Both proofs use the same kind of arguments. 

Proposition 20. Let (X, N'') be a minimal substitution subshift. There exists a 
constant K > such that for every cubic pattern P € L{X), any cubic pattern 
P' G L{X) satisfying \P'\ > K\P\ contains a translated copy of P. 

Proof See [So2]. □ 

This property is called linear recurrence for d = 1 in [DHS ] , strong repetitivity 
in [So2j and linear repetitivity in [LPj for d > 1. We choose to say linearly 
recurrent with the constant K. We say To is linearly recurrent. As in the one- 
dimensional case, all tilings of X have the same property. Hence we also say that 
(X, N**) is linearly recurrent. 

The following result is an extension of the assertion (3) of Proposition [3] to the 
multidimensional case. But the proof is not an extension of the proof of Proposition 
|3l In Proposition [3] we use the fact that if for some n a sequence y G has at 
most n different words of length n, then it is ultimately periodic (see [HMj ). For 
d> 2 such a result does not exist. 

Proposition 21. Let {Y,'N'^) be a non-periodic linearly recurrent f^'^ -subshift. The- 
re exists K' such that for any Ic ^ ~y , "z" € N"^ and any cubic pattern P, if ~x and 
~lj are two different occurrences of P and B{~z ,r) C supp(P) then \ x — > . 

Proof. See [So2]. □ 

6.7. Frequencies of the patterns. For every a G A and every pattern P we 
define 



|iS'"(a)|p = number of occurrences of P in 5" (a). 
We call frequency of the pattern P in T the limit (when it exists) 

freq7-(P) = lim — 7# { k G i?( , n); k is an occurrence of P in t| . 

n — ^oo ri ^ J 

In [Sol| the author proves that self-similar tiling systems are uniquely ergodic for 
the action of R'' by translations. The array Tq can be viewed has a self-similar 
tiling. Then, Theorem 3.3 of |Sol| implies that, for all patterns P, the frequency 
freq.2-(P) exists for all T and 

freqr(P) = freqg-„(P) = lim '"^"("^l^ 



\S-{a)\ ■ 

There is a direct proof following the ideas in |Qu| to obtain the unique ergodicity 
of these N''-subshift. The frequency being independent of T we denote it freq5(P). 
Let us call Lr^S) the set of i?-cubic patterns having an occurrence in some iterates 
of 5. 

Theorem 22. There exists a finite set P C M such that for all R E N there exists 
fc G N satisfying 



{freqs(P); P G LniS)} C {/^'^ / G P} 
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Proof. Let K' be the constant given by Proposition [2TJ Let P E Lr{S) be a 
i?-cubic-pattern. We define k to be the smallest integer such that l'^^^ < R . 
Let L C L2{S) be the set of 2-cubic patterns such that P has an occurrence in 
S'^{B). The choice of k implies this set is non-empty. Let B E L. From Proposition 
[21] it follows that 



We recall for all c £ v4 we have 

frec,s{P)=2^J-Js;!lT^- 

Remark that for all m 



I^^+'WIp = E \S"'{c)\BNik,P,B) 



where 



Then, 



BeL 



N{k,P,B) < ma.x\S''{B)\p < 2'^K"^e. 



|5'm+fe(c)| |S'"+'=(c)| 

\S"^{c)\b |5™(c)| 



J2N{k,P,B 



^7V(fc,P,S)freqs(i?)0-^- 



'm — >oo 

B£L 

We conclude as in the one-dimensional case. □ 

6.8. Factor maps. Let (X, N'') and (F, N^') be two N''-subshifts defined respec- 
tively on the alphabets A and B. We say (X, N'') is a factor of {Y, N'^) if there 
exists a continuous and onto map cj) : Y ^ X such that 0(T -I- Ij^) = (j){T) + 1? for 
all T and V. We say (j) is given by a i?-sliding block code if there exists a map 
/ : B^(°'R^ ^ Asuchthat (/.(r)(VH /(('^+^)[B(o.fl)])- We recall (T+l?)[B(o,i?,)] 
is the restriction of the array (T + V) to i3(0, R). 

Theorem 23 (Curtis-Lyndon-Hedlund iLMj). Any topological factor map cj) be- 
tween N'^-subshifts is a sliding block code. 

Proposition 24. Consider [Y, N"^) a minimal non-periodic substitution subshift 
and {XjN"^) a N'^-subshift. Let cj) : Y ^ X be a factor map. Then, there exists L 
and Rf such that for all R- cubic-pattern P, the set 4>~^(\P]x) is the union of at 
most L {R + Rf)- cubic- cylinders. 

Proof. From Proposition [20l (F, N'') is linearly recurrent with some constant K. 
The factor map (j) is given by a i?/-sliding block code / : B^'^^'^f '' A. The 
N'^-subshift (X, N'^) is clearly linearly recurrent with the constant K: Let K' be 
the constant given by Proposition!^ 
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Let P G L{X) be a i?-cubic-pattern with R > ma.x{Rf,2K'). The continuity of (p 
together with the Cantor structure of F imply (p^^{[P]x) — ^f^i[Pi]Y where the Pi 
are cubic patterns of Y. We can consider that the Pi are {Rj + i?)-cubic-patterns 
(this is the Curtis-Lyndon-Hedlund theorem). Take a {K{Rf + _R))-cubic-pattern 
Q of Y. By definition of the hnear recurrence, aU {Rf + i?)-cubic-patterns have 
an occurrence in Q and a fortiori aU Pi. Proposition [21] says that the difference ~x 
between two occurrences of P is such that \~x\ > R/K'. Hence if ~v' is the occurrence 
of some Pi and Uj is the occurrence of some Pj, then |V — | > R/K'; i.e there is 
at most one occurrence of some Pi in the (i?/i4r')-cubic-patterns. Consequently, the 

number of occurrences of the Pi in Q is at most ^^f/^Jx'y^ ■ Hence n is bounded 
which concludes the proof. □ 

6.9. Proof of the necessary part in the primitive case. 

Theorem 25. Let (Xi, W^) and (X2, N'') he two minimal suhshifts generated respec- 
tively by a p- substitution and a q-substitution. If they have a common non-periodic 
n'^-suhshift factor {Z,W) then p and q are multiplicatively dependent. 

Proof. From Theorem [22] and Proposition [24] there exist two finite sets Fi and F2 
such that 

F = {ireqziP); [P]z cubic-cylinder of Z} C {UnefiFir-") D {UneNF2r-") . 

The subshift [Z, N"*) being non-periodic, the set F is infinite. Hence, using the 
pigeon hole principle, there exist fi G Fi, i — 1,2, and four positive integers ni ^ n2 
and mi ^ m2 with 

flTi = /2?-2 % i = 1,2. 

Consequently, r'l' = r™i . □ 

6.10. Concluding discussion. We did not present a proof of the sufficient part of 
Cobham Semenov theorem because we did not find a symbolic proof which would 
be better than the "logical" one in jMVj (see also BHMV] ). 

Note that other Cobham type theorems exist. In jDu5] the substitutive version of 
Cobham's theorem is extended to what is called "good substitutions" . We strongly 
believe it can be extended to all substitutions but it remains an open problem. In 
[Du3j Cobham's first theorem is generalized to non standard numeration systems 
like Fibonacci numeration system : Un+2 = Un+i + Um Ui = 2, Uq = 1. This 
certainly could be extended to other numeration systems (see [LRj for example). 
In a different direction, a Cobham's type theorem can be found in [HRSj . In this 
paper they consider R'^-translation orbit closure of self similar tilings. (In our paper 
we consider N'^-translation orbit closure of tilings generated by multidimensional 
constant length substitutions.) The goal of their paper is not related to Cobham's 
theorem but the main argument they establish (Proposition 3 of [HRS| ) is clearly 
a Cobham's theorem for self similar tilings. 
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